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ABSTRACT. Kinematical and dynamical properties of chaotic systems are reviewed and a few applications 
are described. 



§1 Kinematics. 

Gases and fluids are described mathematically by dynamical systems. This means that 
the system state is a point in a phase space C on which the motion is given by a map S. 

Typically C is a finite dimensional smooth finite (i.e. compact) manifold and S is a local 
diffcomorphism of C. But sometimes C and/or S have singularities (think for instance of 
hard sphere systems) located on a closed set N of zero volume: in such cases the points of 
C/N are the regular points where S is smooth, of class C°°, with non vanishing jacobian 
determinant and N will always contain the boundary of C, if any. 

The transformation S can be invertible: this means that there is another map S" -1 with 
a singularity set N' (with volume) such that if x ^ N and Sx ^ N' then S~ 1 Sx = x, 
and if x ^ TV' and S~ l x £ N' then SS^x = x. 

Unless explicitly stated we always suppose the maps we consider to be invertible and 
without singularities. 

If there is a smooth map i of C into itself such that i 2 = 1, and iS — S~ x i we say thgat 
the system is time reversal symmetric. We can always suppose that i is an isometry 
(possibly changing the distance d(x,y) to d(x,y) = d(x,y) + d(ix,iy)). 

The main question is: given (C, S) and picking x € C randomly with a distribution /in. 
proportional to the volume dx: (j,o(dx) — po(x) dx (for some po € L\{C,dx)), which is 
the asymptotic behaviour of the motion starting at x? 

This means asking what can one say about the averages: 

lim ~Y l f(S*x) = {f) ± (1.1) 

3=0 

In the case of a non invertible transformation one chooses, of course, only the sign + in 
(1-1). 

We shall always suppose that the averages exist for /in-almost all x's: this assumpiton 
is called 0-th law. The necessity of the clause "/in-almost everywhere" is because in 
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essentially all cases there are obvious exceptions. Typically each periodic orbit is an 
exception and usually periodic orbits form a dense set in phase space (the cases with no 
periodic orbits, as the quasi periodic motions, must be regarded as exceptional). 

The independence on x of the r.h.s. of (1.1), i.e. the uniqueness of the averages, is not a 
very strong assumption. Usually it fails because C breaks into open parts U\ , U2 , ■ ■ ■ whose 
union has full volume and in each of them the asymptotic behavior is unique, i.e. for 
each j one has x- independence almost everywhere of the average ^o - &hnost everywhere 
in Uj. Thus the system would, essentially, break down into several systems each of which 
verifies the 0-th law. 

The averages (f)± can be written: 

(f)± = J mi*±(dy) (1.2) 

where fi± is a probability distribution that will be called the forward statistics (+) and 
the backward statistics (— ) of no or the statistics of the motions with respect to initial 
data choices at random with respect to the volume in phase space. 
Are there systems (C, S) for which fi can be determined? 

(1) Hamiltonian systems: in this case the phase space is the space of the configurations 
and momenta x of given energy that verify some "timing property", for instance x lies 
on a certain surface signaling some interesting event, e.g. a collision. If t(x) is the time 
interval between the "event" x and the next one of the same type the map S is cxprcssscd 
in terms of the solution flow t — ► St of the equations of motion by S = S t ( x -) . The Liouville 
theorem gives us a probability distribution ^ which is proportional to the volume on C 
and it is invariant: Ho(E) = fio(S~ 1 E) for all sets E for which it makes sense to measure 
the volume ("volume measurable sets"). 

The ergodic hypothesis, when reasonable, tells us that /zo is the statistics of itself: fi + = 
Mo, [!]■ 

(2) Transitive Anosov systems: in this case the phase space is a smooth manifold (say 
C°° ) and through each point x pass a smooth stable manifold W x and a smooth unstable 
manifold W x such that their tangent planes T X ,T X at x: 

(i) are S'-covariant (i.e. dST™ = Tg x ,dST x = T§ x ), vary continuously with x, in the 
sense that T X ,T X are Holder continuous as functions of a:, 1 and the full tangent plane is 
the direct sum of T*, T x . 

(ii) for all n > 0: 

(a) \\dS n v\\ s „ x <Ce-^\\v\\ x veT* 

(b) \\dS- n v\\s^<Ce~ nX \\v\\ x v € T x 

(iii) There is a periodic point with stable and unstable manifolds dense in C. 

It then follows that all points have dense stable and unstable manifolds and that the 
periodic points are dense. 

For such systems the 0-th law holds as a mathematical theorem, [2], [3], p. 757. 

One might prefer to require real smoothness, e.g. C p with 1 < p < oo: but this would be too much for rather 
trivial reasons. On the other hand Holder continuity might be equivalent to simple C°— continuity as in the 
case of Anosov systems, see [4], [3]. 
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(3) Transitive Axiom A attractors: Suppose that (C,S) is a smooth system and S is a 
smooth invariant surface which is attracting in the sense that the distance of S n x from 
S tends to exponentially: d(S n ,S) < Ce- Xn d{x,S) for all x in the vicinity of S (with 
C, A > suitable constants). If (S, S) is a transitive Anosov system then this is an 
example of a transitive Axiom A attractor. More generally an Axiom A attractor needs 
not be a smooth manifold; it can be a non smooth closed attracting set A provided: 

(a) at each point x G A the tangent space to C can be decomposed into two planes T®, T" 
so that (i), (ii) above hold, and (iii) is replaced by the requirement that on A there is a 
periodic point with dense stable and unstable manifolds. 

For such systems the 0-th law holds and the statistics /i± are called SRB distributions. 

In general we call attracting set a closed invariant set A such that d(S n x,A) n _ >+oc > 
for all x sufficiently close to A, and furthermore no subset of A enjoys the same property. 
A repeller is defined analogously by using S* _1 instead of S. 

Note that if A is an Axiom A attractor then the planes are " tangent to the attractor" 
in the sense that they are everywhere tangent to a smooth invariant surface contained 
in A. But the plane are not tangent to A: in fact part of them sticks out of A 
(describing the fall on the attractor of the nearby points) and part of them lies in some 
sense on the attractor and describes motions that stay on the attractor and are homoclinic 
as n — > +oo, i.e. approach each other asn^ +oo. 

In general through every point in an Axiom A attractor pass two smooth manifolds 
W x s , W s " which are tangent to T£,T£. 

(4) Axiom A systems: A notion more general than the previous one can be given by 
introducing the non wandering points: they are the points x such that any of their neigh- 
borhoods returns infinitely often near x. This means that for any pair of neighborhoods 
U, V of x there are infinitely many n > such that S n V n U ^ 0, [3] p.749. 

A smooth system (C, S) verifies Axiom A if the set f2 of nonwandering points is hyper- 
bolic, i.e. it verifies property (i), (ii) of the previous example (2) and (iii) is replaced by 
the requirement that the periodic points are dense, [3] p. 777, [5] p. 154. 

If an Axiom A system has an attractive set then it is called an "axiom A attractor" 
(not necessarily transitive, however see below). In general through every non wandering 
point in an Axiom A system pass two smooth manifolds W*, W" which are tangent to 

1 x i 1 x ■ 

The nonwandering set of an Axiom A system splits into a finite union of invariant sets, 
called basic sets, {flj} on each of which there is a dense orbit ("topological transitivity", 
not to be confused with the above notion of transitivity). And each of such sets splits 
into a finite union of sets {Bjk} each of which is invariant for an appropriate iterate S Pj 
which acts transitively on it (in the sense of the previous examples: namely the stable 
and unstable manifolds of a periodic point are dense). Thus if Slj is an attractor each 
Bjk is an Axiom A transitive attractor for S Pj ). 

The 0-th law holds for the attractors of an Axiom A system. 

(5) Axiom C systems: The Axiom A and the zeroth law properties could be reasonably 
taken as properties characterizing models for globally " chaotic" or " globally hyperbolic" 
systems. But in Axiom A systems there are no relations between the different basic 
sets. And therefore other more global kinematic notions have been considered in the 
literature: we shall not dwell here on the Axiom B notion and we deal directly with 
the Axiom C notion that is particularly appropriate for invertible maps enjoying a time 
reversal symmetry. In fact the problems that we pose in the next sections suggest that 
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the appropriate notion for "globally hyperbolic" or "globally chaotic" dynamical systems 
is somewhat stronger than that of Axiom A or B. 

Axiom C systems are Axiom A systems enjoying further properties. To describe them 
we introduce the notion of distance of a point x to the basic sets {Qi} of an Axiom A 
system as: 

o(x) = mmjmm — , ml — — } (1.4) 

i do j, -oo<n< + oo d 

where do is the diameter of the phase space C, dn i (x) is the distance of the point x from 
the basic set Hi , the minimum over i runs over the attracting or repelling basic sets and 
the minimum over j runs over the other basic sets (if any) . 
We can then define the Axiom C systems as: 

A smooth dynamical system (C, S) verifies Axiom C if it verifies Axiom A and: 

(1) among the basic sets there are a unique attracting and a unique repelling basic sets, 
denoted f2 + ,f2_ respectively, with (open) full volume dense basins of attraction. We call 
fl± the poles of the system (future or attracting and past or repelling poles, respectively). 

(2) for every x £ C the tangent space T x admits a Holder-continuous decomposition as 
a direct sum of three subspaces T",T*,T™ such that: 

a = ii, s, to 
\w\, w £T£, n>0 

\w\, w £T£, n > 

-l e -A|n|| w | ; w g T m ; y n 

where the dimensions of T", T*, T™ are > and 8(x) is defined in (1.4); here dS n is the 
jacobian matrix of S n . 

(3) if x is on the attracting pole f2+ then T£ © is tangent to the stable manifold in 
x; viceversa if x is on the repelling pole S!_ then T" © T™ is tangent to the unstable 
manifold in x. 

Although T" and are not uniquely determined by the above definition the planes 

© T™ and T" © T™ are uniquely determined for x £ f2+ and, respectively, x £ 0_. 
The above notion of Axiom C is in [6] . It is clear that an Axiom C system is necessarily 
also an Axiom A system verifying the 0-th law. 

If f2+ and f2_ are the two poles of the system the stable manifold of a periodic point 
p £ Sl + and the unstable manifold of a periodic point ? e !!_ not only have a point 
of transversal intersection (this would be the property characterizing Axiom B systems 
among the Axiom A ones with only two basic sets) but they intersect transversally all 
the way on a manifold connecting Q + to the unstable manifold of a point in f2_ will 
accumulate on fi + without winding around it, [6]. 

In fact one can "attach" to W^, p £ points on f2_ as follows: we say that a point 
z £ 0_ is attached to W£ if it is an accumulation point for and there is a closed 
curve with finite length linking a point z £ to z and entirely lying on W£, with 
the exeption of the endpoint z. A drawing helps understanding this simple geometrical 
construction, slightly unusual because of the density of W£ on 

We call W S p the set of the points either on Wp or just attached to Wp on the system 

basic sets (the set Wp should not be confused with the closure clos(W^), which is the 

whole space, see [3], p. 783). If a system verifies Axiom C the set Wp intersects 0_ on 
a stable manifold, by 2) in the above definition. 

The definition of W q , q £ 0_, is set symmetrically by exchanging + with 0_. 
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Furthermore if a system verifies Axiom C and p € f2+, q € f2_ are two periodic points, 
on the attracting and on the repelling pole of the system respectively, then W p and W q 
have a dense set of points in fi_ and f2+, respectively. 

Note that W p DW q is dense in C as well as in fi + and This follows from the density 
of Wp and W q on f2 + and f2_ respectively and from the continuity of T™. Furthermore 
if z € + is such that z £ W' p n then the surface VFp n W q intersects in a unique 
point z = iz which can be reached by the shortest smooth path on W p n linking z to 
C_ (the path is on the surface obtained as the envelope of the tangent planes T m , but 
it is in general not unique even if T m has dimension 1, see the example in Appendix Al 
below) . 

The map i commutes with S, squares to the identity, maps f2+ to f2_ and viceversa and 
will play a key role in the following analysis. 

In [6] it is conjectured that the Axiom C systems are fi-stable in the sense of Smale, [3] 
p. 749, i.e. small perturbations of Axiom C systems are still Axiom C systems. 

$2 Dynamics and chaotic hypothesis 

We begin with a few examples of systems that we would like to study. 
Example 1: We shall consider the problem of electrical conduction in a crystal via the 
classical model representing the crystal as a periodic array of circular obstacles among 
which free charges, also modeled by hard spheres, move undergoing elastic collisions with 
the obstacles and between themselves. 

An electric field acts upon the moving particles establishing a current. We suppose that 
the obstacles are such that there is no straight line that avoids them. Nevertheless clearly 
the electric field will continue to work so that the electric current will grow unbounded 
and therefore the system will show infinite conductivity. 

The reason why electrical conductivity is finite in physical systems is simply that there 
are dissipative effects. The simplest theories of conductivity, like Drude's theory, write 
directly a dissipative equation for the motion of the N moving particles contained in a 
crystal with cell of size a: 

P ■ 

q. = —, P • = Ei+Ei-vp . (2.1) 

where v = 2£v~ 1 if i is the mean free path (equal to I = (Anpd 2 )^ 1 if d is the radius of the 
particles and p = N / a 3 their density) and v is the average velocity, Fj is the (impulsive) 
force acting on the j-th particle. This special choice of v corresponds to a dissipation 
proportional to the speed: no special physical meaning should be attached to this choice 
which is here used only as an example. 

Periodic boundary conditions are imposed at the cell boundary in the direction parallel 
to the field and reflecting conditions are imposed in the direction orthogonal to the field 
(semiperiodic boundary conditions), to fix the ideas. 

The above are not the only equations that we can use for the conduction problem: in 
fact the coefficient v has an empirical nature and it is supposed to model a thermostat 
action that forbids the system to reach arbitrarily high current. 

At the same level of rigor one could equally well maintain that the thermostat action 
is that of keeping the total energy of the moving particles constant; hence one can think 
of imposing the anholonomic constraint J2i P 2 = cons t an d the constant should be 
£ = N(^ksT + \mv 2 ) if T is the absolute temperature, ks the Boltzmann constant and 
v is the average drift velocity. 
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Gauss asked himself what would be the way of imposing a nonholonomous constraint 
affecting in a " minimal fashion" the dynamics and formulated the well known extension of 
D'Alembert's principle called the least constraint principle. If the constraint of constant 
energy is imposed on the above system, by Gauss' principle, the resulting equations take 
the form (2.1) with v replaced by a multiplier a given by: 

Ei p . 

"(g) = V , a "' (") 

A brief computations shows that indeed if in (2.1) one substitutes v — > a the solutions of 
the new equation (2.1) keep a constant total (kinetic) energy. This model was considered 
for instance in [7]. 

Of course the value of the constant given to the energy cannot be arbitrary. Suppose 
that the dissipative effects modeled by v are such that when the external field is E then 
the system relaxes to a stationary state in which the average energy is £. Then if we want 
that the model (2.1) or the same model with v replaced by the a of (2.2) to be equivalent 
we should fix the energy in the second model to be exactly £. For large systems the two 
models should be equivalent, [8]. 

Example 2: A more physical model for electric conduction is: 

q = p/m, p = Fj-0p + mB j (t)+Ei_ (2.3) 

— 3 — 3 — 3 J — 3 J 

where Bj is a brownian motion. This is a Langevin equation in which [3 is a friction 
coefficient (e.g. v above) and the dispersion 5 of each component of B • is related to the 
temperature (by = \k B T , see [9], p. 55). 

The above equation is clearly phcnomcnological and one should think that the same 
physical system may be described in a different way, for instance by the system of equa- 
tions: 



lj = Ej/ m { t jk = B jk 

Pj= Ej-ap.+mB^+Ei ' { B jk = -d b _ .V{{b jlk ,}) 

def 



(2.4) 



k 

where the multiplier a is determined by imposing that the total kinetic energy £ 
J2j 2mP 2 01 the charged particles stays constant: 



<J" 2m tL j 

EjiEi-B^-p 



(2.5) 



and the (b, B_) coordinates describe a hamiltonian system with many degrees of freedom 
(i.e. the label k takes many values, e.g. k » N) which acts as a heat reservoir on the 
system of particles. 

Again the equivalence between the two equations can hold only if the value £ of the 
energy in (2.4) is fixed equal to the average value of the energy of (2.3) and, at the same 
time, the motion of the (hamiltonian) dynamical system for the evolution of the variables 
b, B_ is so chaotic that the variables B_- can be reagrded as a gaussian process whose 
covariance is the same, or close, to that of the homomimous stochastic process in (2.3), 
[8]. 
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Example 3 As third and last example we shall focus on fluid mechanics problems consid- 
ering a fluid that: 

(1) is enclosed in a periodic box f2 with side L, possibly with a few disks ("obstacles") 
removed so that no infinite straight path can be found in fl that avoids the obstacles, 

(2) is incompressible with density p. 

I shall consider two distinct evolution equations for this fluid, both of dissipative nature. 

u + u ■ d u = —-dp + g + i/Aw, d ■ u = NS 

.... ., 

u+u -d u= — dp+g+(3Au, d-u=0 GNS 
~ ~ p 

In the case fl contains obstacles a "no friction" boundary condition will be imposed on 
dSl, i.e. u ■ n — if n is the normal to <9f2. The first equation is the well known Navier 
Stokes equation with v being the viscosity. 

The second equation, introduced in [8] and called the gaussian Navier Stokes equation 
or GNS equation, has a multiplier (3 defined so that the total vorticity nL 3 = pj lu 2 dx, 
with uj = d A u being the vorticity, is a constant of motion; this means that: 

J {9 A g ■ u + to ■ (w ■ d u)) dx 

/?(«) = " f ,o a ^ & + ft (2-7) 

J (o A lu Y dx 

The above equations were studied in [10]. We shall consider only the truncated equa- 
tions with momentum cut off K . In this way the existence and uniquenes problems are 
completely avoided. 

The truncation is performed on a suitable orthonormal basis for the divergenceless fields 
in f2: given the boundary conditions we consider it natural to use the basis generated by 
the minimax principle applied to the Dirichlct quadratic form Jq(8 u) 2 dx defined on 

the space of the C°°(Q) divergenceless fields u with u ■ n = on dtt. The basis fields 
Uj will verify: = —EjUj + d^p, for a suitable multiplier pj, with Uj,Pj € C°° 

and Ej are eigenvalues). Thus truncating at momentum K means setting identically 
the components on Uj with Ej > K 2 . 

For instance in the case of no obstacles let the u = J2k^o 1 k el ~'~ be the velocity 

field represented in Fourier series with 7 = ~_ (reality condition) and k ■ 7 =0 

(incompressibility condition); here k has components that are integer multiples of the 
"lowest momentum" ko = Then consider the equation: 

i k =-$(k)l k -i Yl ilk/ ^)^kl k2 + 9 k (2.8) 

k ~\~ 2 — k_ 

where the k 's take only the values < \ k\ < K for some momentum cut-off K > and 
rife is the projection on the plane orthogonal to k. This is an equation that defines a 
"truncation on the momentum sphere with radius K of the equations (1.1)" if: 



d{k) = -i>k 2 NS case 
tf(k) = -pk 2 GNS case 



■2 ( 2 - 9 ) 



For simplicity we may suppose, in this no obstacles cases, that the mode k = is absent, 
i.e. 7 = 0: this can be done if, as we suppose, the external force g does not have a 

zero mode component (i.e. if it has zero average). 
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In order that the resulting cut-off equations be physically acceptable, and supposing 
that g =/= only for \ k\ ~ fco, one shall have to fix K large. For instance in the NS 

case it should be much larger than the Kolmogorov scale K = (r]L'~ 2 ) 1 / 4 , where wq is the 
average dissipation of the solutions to (2.6) with K = +00 (determined on the basis of 
heuristic dimensional considerations by n ~ \ g \ L 2 v~ 2 ): sec [11]. 

It is easy, in the no obstacles cases, to express the coefficients (3 for the cut off equations 

as 13 EE /3 e + ft: 



E* * 4 ll> " 

2 (2-10) 

-»Efc 1+ fc 2+i3 ^o fc 3 (7fc i ' fc 2 )(7fc 2 ' 2 k ) 



where the fc 's take only the values < \k \ < K for some momentum cut-off K > and 
life is the orthogonal projection on the plane perpendicular to k. 

The above two equations should be again equivalent in the natural sense that the sta- 
tionary distribution [i v . ns of the (truncated) NS equation with viscosity parameter v and 
the stationary distribution [i v ,gns of the GNS equations should give the same averages to 
" most observables" if the value 77 is fixed to be the same as the average of the dissipation 
in NS equations, [10]. 

A key remark suggested by the above equivalence statements, which are in fact only 
conjectures, is that the stationary distributions can be regarded as statistical ensembles 
and the conjectures state that the same thermostatting mechanism can be equivalently 
modeled by completely different equations. 

Furthermore the same thermostatting mechanism can be modeled by a "usual" irre- 
versible equation of motion and also by a reversible one. 

Much in the same way as in equilibrium statistical mechanics the same system can be 
described by the microcanononical or the canonical ensembles. Of course as in classical 
equilibrium mechanics the ensembles are not completely equivalent as one can always find 
quantities that have a completely different behavior in two "equivalent" systems (e.g. the 
energy fluctuations are quite different in the canonical and microcanonical ensembles even 
when they are equivalent). 

The real problem is, however, how to find the stationary distributions? 

This has been an outstanding problem in the last century: a solution for it was proposed 
in 1973 by Ruellc but it went unnoticed. Probably because it was formulated (later, 
e.g. in [12], and) in a way that a concrete application seemed far away. 

Recently the Ruelle's principle has been shown to imply far reaching consequences. We 
therefore proceed to formulate it in a modern form having in mind to apply it (as an 
example) to the systems (C,S) obtained by timed observations on the above described 
family of systems. 

In [13], [7], [14] the above systems, at least when chaotic i.e. when showing at least one 
positive Lyapunov exponent, are supposed to verify following hypothesis: 

Chaotic hypothesis: A reversible many particle system in a stationary state can be re- 
garded as a transitive Anosov system for the purpose of computing the macroscopic prop- 
erties. 

This means that the attractor verifies Axiom A and it is assumed to be smooth and 
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hyperbolic in the strict mathematical sense described in the kinematics section. At zero 
forcing we suppose, for compatibility with the ergodic hypothesis, the system to be a 
transitive hamiltonian Anosov system so that the attractor is the full phase space. 

In the quoted references it is argued that the chaotic hypothesis should be considered 
in the same way as the ergodic hypothesis in equilibrium statistical mechanics. It is 
assumed as correct even in cases in which it cannot be mathematically strictly valid: 
but this can be done only for the purpose of deriving statistical properties of a few 
relevant observables. An analogue of this procedure is the derivation of the second law 
(heat theorem) from ergodicity (i.e. from the microcanonical ensemble) in equilibrium 
statistical mechanics: the law is derived by supposing ergodicity and it is assumed valid 
even when the ergodic hypothesis is obviously false (e.g. for the free gas in a box), [1]. 

It might be surprising that something can be concretely derived from this assumption: 
in fact it is very ambitious and far reaching and we shall see it has implications. 

And it is clear that any consequence of such a general assumption must be a param- 
eterless prediction, i.e. a universal law. Examples of this type of deductions are well 
known: the most remarkable is perhaps the mentioned heat theorem of Boltzmann which 
derives the second law of thermodynamics from the ergodic hypothesis (i.e. from the 
microcanonical ensemble). 

In noncquilibrium thermodynamics there are no universally accepted laws that we could 
use as a reference and a test of the theory: except perhaps the Onsager reciprocity and 
the fluctuation dissipation theorems. The latter are derived in very many different ways 
and there is a general agreement about their correctness, see [15]. 

The Onsager reciprocal relations and the fluctuation dissipation relations (also called 
Green-Kubo formulae) have been subject of many unsuccessful, or only partially success- 
ful, attempts to extend them to non zero (or large) external forcing. In fact they "only" 
express relations between derivatives with respect to external fields evaluated at fieldsl 

We shall argue that the above chaotic hypothesis not only implies the Onsager relations 
(when they are valid, i. e. at zero fields) but that it also leads to an extension of them, in 
the form of a general parameterless statement, that will be called the fluctuation theorem. 

§3 Reversible dissipation and the fluctuation theorem. Markov partitions (coarse graining 
and symbolic dynamics). 

Note that the examples in §2 are pairs of dynamical systems which are conjectured to 
represent the statistical properties of the same system and the pairs have been delib- 
erately selected so that each pair consists of a manifestly dissipative equation and of a 
manifestly reversible equation. 

It is easy to check that the second equation of the example 1) is reversible: the operation 
i : (p , q) — > (— p , q) induces a time reversal in phase space in the sense that the solution 
flow S t verifies iS t — S- t i. This implies that the timing map S (on any time independent 
timing event) verifies iS = S~ 1 i. 

Likewise the map i : (p , B_, q, b) — > (—p, — B_, q, b) induces a time reversal symmetry 
in the phase space of the second equation in the second example. 

And the second equation of the third example admits also the time reversal symmetry 
i : u — * — u- 

In this section we shall restrict attention to such reversible systems and we shall suppose 
that i is an isometry (not restrictive, see §1) and that the chaotic hypothesis is verified. 

Let a; be a point on the phase space C of the systems that we study through timed 
observations and therefore are dynamical systems (C, S) of the type considered in §1. We 
define the jacobian matrix J(x) = dS(x) and regard it as a linear map of the tangent 
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plane T x onto T$ x - We also define the action of dS(x) on the stable and unstable planes 
T X ,T X as a linear map J u (x) or J s (x) onto Tg x ,T§ x . 

We call A u (x), A s (x) the determinants of the jacobians J u (x), J s {x): their product differs 
from the determinant A{x) of dS{x) by the ratio of the sine of the angle a{x) between 
the planes T X ,T X and the sine of the angle a(Sx) between T^ X ,T^ X . We set: 

t/2-1 t/2-1 t/2-1 

ka*) = n a u(s>x), a„ax) = n ^x), A T ( X ) = n a (^) ^ 

j=-r/2 j=-r/2 3=-t/2 

hence A(x) = ^^A s (x)A u (x). 

The time reversal symmetry implies that W? x = iW x , Wf x = iW x and: 

A T (ar) = A T (ix)- 1 , A StT {ix) = A„ iT (a;) _1 , A UjT {ix) = A^z)" 1 
sma(x) = sin ayix) 

The quantity a(x) = — logA(x) will be called the entropy production per timing event 
so that e~ a ( x ) is the phase space volume contraction per event. 

It seems that a{x) has all the properties that one may wish for the extension to non 
equilibrium thermodynamics of the ordinary entropy, even though not everybody would 
agree on this statement. 

For instance it is well established in many numerical experiments that the time average 
of cr(x) is > and > when the external fields are non zero. In zero field a(x) vanishes 
(an expression of Liouville's theorem). 

Recently the general non negativity of the time average of cr(x) has become a theorem 
(by Ruche) that can be rightly called, I feel, the H-theorem of reversible non equilibrium 
statistical mechanics, [16]. More generally one can consider systems that arc smooth 
only piecewise and extend the notion of SRB distribution so that, if the latter does not 
have nontrivial vanishing Lyapunov exponents, it can be also shown, c.f.r [16], that the 
average of a(x) can vanish on a SRB distribution 2 has a density with respect to the 
volume. 

Therefore we shall call dissipative the systems for which the time average of a(x) is 
positive, [17]. 

We call a T (x) the partial average of a(x) over the part of trajectory centered at x (in 
time): S~ T / 2 x, . . . , S T / 2 ~ 1 x. Then we can define the dimensionless entropy production 
p = p(x) via: 

t/2-1 

*t(x) = ± J2 v(S j x)= f {o-)+P (3.3) 

T j=-r/2 

where (a) + is the infinite time average J c a(y)^i + (dy), if fi + is the forward statistics of 
the volume measure, and r is any integer. 

The chaotic hypothesis implies a fluctuation theorem which, [7], is a property of the 
fluctuations of the entropy production rate in dissipative systems; the (dimensionless) 
finite time average p = p(x) has a statistical distribution 7r T (p) with respect to the 
stationary state distribution /i + such that the following limit exists: 

In general there may be many of them but, by the chaotic hypothesis assumed here, there is one and only 
one. 
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lim -j^- \ogir T {p) = -C(p) (3.4) 



r^oo 



for all p's in the domain (— p*,p*) where p(x) can vary, and: 

C(p)-C(-p) _ 1 



(3.5) 



provided the attractor is dense on phase space cmd the system is reversible. This is the 
fluctuation theorem of [13]: later we indicate its extension to cases in which the attractor 
is smaller than the whole phase space. Under our assumptions the function £(p) exists 
and is real analytic in p e (—p*,p*), [17]. 

The above relation holds in the whole domain of variability of p (which is in general 
a bounded variable because C is a bounded manifold): and therefore it is a surprising 
parameterless prediction. 

The proof of the fluctuation theorem goes to the core of the structure of chaotic systems 
and is very enlightening. The key point is that Anosov (and Axiom A) systems have a 
dynamics that can be easily transformed into a symbolic dynamics, [2]. 

One can in fact find a partition of phase space C into parallelograms. A parallelogram will 
be a small set with a boundary consisting of pieces of the stable and unstable manifolds 
joined together as described below. The smallness has to be such that the parts of the 
manifolds involved look essentially "straight": i.e. the sizes of the sides have to be small 
compared to the smallest radii of curvature of the manifolds and W£ , as x varies in 
C. 

Therefore let 5 be a length scale small compared to the minimal (among all x) curvature 
radii of the stable and unstable manifolds. Let W™' S , W£' S be the connected parts of W", 
W£ containing x and contained in a sphere of radius S. Let us first define a parallelogram 
E in the phase space C, to be denoted by A" x A s , with center x and axes A™, A s with 
A" and A s small connected surface elements on and W£ containing x. Then E is 
defined as follows. 

Consider £ e A u and r\ e A s and suppose that the intersection £ x r\ = W^' S n W™' S 
is a unique point (this will be so if 5 is small enough and if A™, A s are small enough 
compared to 5 as we can assume, because the stable and unstable manifolds are "smooth" 
and transversal). 

The set E — A" x A s of all the points generated in this way when £, n vary arbitrarily 
in A", A s will be called a parallelogram (or rectangle), if the boundaries d A™, dA s of A" 
and A s as subsets of and W£, respectively, have zero surface area on the manifolds 
on which they lie. The sets d u E = A u x dA s and d s E = dA u x A s will be called the 
unstable or horizontal and stable or vertical sides of the parallelogram E. 
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and vertical segments through n and £, respectively, have size 5); the third picture shows the rectangle 
E with the axes and the four marked points are the boundaries dA u and dA s . The picture refers to a 
two dimensional case and the stable and unstable manifolds are drawn as flat, i.e. the A's is very small 
compared to the curvature of the manifolds. Travsversality of , is pictorially represented by drawing 
the surfaces at 90° angles, from [7]. 

Consider now a partition £ = (E\, . . . , £V) of C into N rectangles Ej with pairwise 
disjoint interiors. We call d u £ = Ujd u Ej and d s £ = Ujd s Ej-. these are called respectively 
the unstable boundary of £ and the stable boundary of £, or also the horizontal and vertical 
boundaries of £, respectively. 

We say that £ is a Markov partition if the transformation 5* acting on the stable boundary 
of £ maps it into itself (Sd s £ C d s £ ) and if, likewise, the map S 1-1 acting on the unstable 
boundary maps it into itself (S~ 1 d u £ C d u £). 

The actual construction of the SRB distribution then proceeds from the important 
geometric result of the theory of Anosov systems expressed by what we shall call "Sinai's 
first theorem", [2]: 

Theorem: every transitive Anosov system admits a Markov partition £, as fine as wished. 

The term "fine" refers to the maximum size 5s of the sets E G £ which can be fixed 
a priori to be smaller than any prefixed length ("as fine as wished"). 
The theorem can be extended to imply the existence of more special Markov partitions: 
for instance to show the existence of Markov partitions with any one of the following 
three properties (the last shows that the first two can be realized simultaneouly and will 
play a key role in our analysis): 

(1) The construction of £ can be done, [18], so that the horizontal axes of Ej all lie on 
Wq (and the vertical on Wq) and their union is a set that can be obtained from a single 
small connected surface element A of Wq (rcsp. A of Wq) containing O by dilating 
it with a high iterate of the time evolution S. In other words the union UjA" of 
the horizontal axes of the parallelograms Ej G £ can be regarded as a single connected 
surface which in turn is a good finite representation of the attracting set. 

Likewise the union of the stable axes can be regarded as a large connected part of the 
stable manifold Wq. 

(2) If the reversibility property holds it is clear that i£ is also a a Markov partition. 
This follows from the definition of Markov partition and from the fact that reversibility 
implies: 

W' x = iW? (3.6) 

(3) The definition of a Markov partition also implies that the intersection of two Markov 
partitions is a Markov partition, hence it is clear that there are Markov partitions £ that 
are reversible in the sense that £ = i£ (if this is not true one can just intersect £ and its 
i-image i£). 

Once a Markov partition £ = (E\, . . . , £V) is given one can associate with each point 
x E C the history h(x) = (..., h-i, ho, h\, . . .) of the motion of x on £: it is the sequence 
of the names of the elements of £ that x visits at time j: S^x G Ef lj . 

This is well defined for all x G C which do not visit in their evolution the boundaries 
of the elements of £: the latter points form a set of zero volume and can be disregarded 
for the purposes of our analysis (having zero probability of being ^"randomly chosen). 
The expansion and contraction properties of the sides of the parallelograms imply that 
if the partition £ is fine enough (precisely if the images under S of each E G £ intersects 
any other E' G £ in a connected set) they are small enough only one point can have a 
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given history and two points that have histories with identical symbols hj for \j\ < N 
are close within a distance 0(e~ XN ). 

The symbolic representation is " as good" as the decimal representation of (the cartesian 
coordinates of) the points: provided we shall always suppose the Markov partition to be 
"fine enough" (in the above sense) it follows that the points x e C are determined by the 
digits of their symbolic representation h with an exponential precision, in the sense that 
they get close at exponential rate in the number of agreeing digits. It is however much 
better than the usual decimal representations because it is adapted to the dynamics. The 
map S in fact just becomes the translation of the the history h(Sx) = "&h if $ is the 
left shift, obviously. 

The above are properties that the history of x on £ shares with most fine partitions 
of phase space. There is however a property that is enjoyed by the ordinary decimal 
representations of the cartesian coordinates and that is a priori missing in the symbolic 
representations that are derived from the histories on a generic partition of phase space. 

Namely the ordinary representation of the cartesian coordinates by decimal digits has 
the remarkable property that not only each point can be represented symbolically but 
also any sequence of digits determines a point. 

The latter property does not hold in general for the symbolic representation of a point 
via the history of its motion along a partition. In fact if we define the compatibility 
matrix M hh > for the partition £ by setting M hh > = 1 if SE% n E%, ^ (here E° denotes 
the interior of a set E) and Mhh 1 = otherwise, then h can be the history of a point 
only if Mhjhj+t = 1- We call such sequences compatible by nearest neighbours. 

Note that if a system is a transitive Anosov system the matrix M has the mixing 
property, namely M n has strictly positive matrix elements for all n large enough. 

But it is clear that the compatibility condition is not the only one: one should define 
compatibility conditions linking three neighboring times j,j + + 2 or four and more. 
This shows that constructing a symbolic dynamics representation of a given dynamical 
system is a very non trivial problem. The understanding of the compatibility conditions 
essentially requires a complete solution of the dynamics in order to tell a priori which 
are the symbolic sequences that can actually represent a point, i.e. it requires the con- 
struction of a code of the phase space into a space of sequences. 

What is special about Markov partitions is that we can a priori tell which are the pos- 
sible histories of points in C. They are simply all the sequences compatible by nearest 
neighbours: the space of such sequences will be called Km- This is very remarkable as 
in some sense it solves completely the problem of studying the motions of a transitive 
Anosov system. 

Whatever the transitive Anosov system is the time evolution can be represented simply 
as a shift on the space Km of biinfinite sequences of symbols subject only to nearest 
neighbour compatibility conditions (" subshift of finite type" with a suitable compatibility 
matrix). Therefore the Anosov systems play the same role that the harmonic oscillators 
play in perturbation theory: most systems are not Anosov systems, but since we "fully" 
understand Anosov systems they become the paradigm of non integrable motion just as 
the harmonic oscillators are the paradigm of integrable motions, [2]. 

§^ Volume measure. Forward and backward SRB distributions. 

An immediate consequence of the symbolic dynamics representation of the points of C 
in the cases in which (C, S) is a transitive Anosov system is that the Liouville measure 
(i.e. the volume measure) on C and its statistics are easily expressible in symbolic terms. 

We consider the partition £ T = cF_ T S~i£ obtained by intersecting the images under , 
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j = —T, ...,Tof£. Then £ T is still a Markov partition and it is time reversal invariant 
if £ is. 

We note that the parallelograms of £t can be labeled by strings of symbols h-T, ■ ■ ■ ,hr 
and they consist of the points x such that S x G Eh k for — T < k < T. In other words 
the parallelograms consist of those points x which in their time evolution visit at time k 
the parallelogram (one usually says that these are the points whose symbolic dynamics 
on £ coincide, at the times ("sites") k between — T and T). 

In a parallelogram any point can be regarded as center. However there are special 
points that arc usually taken as centers because they play a special role. Accordingly 
we shall take as center Xh_ T ,...,h T °f a parallelogram Eh_ Ti ...ji T a point whose symbolic 
dynamics string j at the times k with \k\ > T is fixed in a standard way; i.e. by 
defining hk for k > T (respectively k < — T) as a compatible sequence depending only 
on hr (respectively H-t)- We cannot in general make the simple choice of continuing 
h-T, ■ ■ ■ , hr at times k > T or k < —T with a fixed symbol because this may lead to 
an incompatible sequence (hence the choice that we make is "the simplest" possible: it 
leaves some arbitrariness because there are many strings of symbols that start with a 
given symbol. The arbitrariness is however irrelevant for what follows). 

The possibility of the above continuation of finite strings into infinite strings relics on 
the mixing property of the matrix M, consequence of the transitivity of (C, S). 

Another simple choice of the center is to pick a periodic point in Eh_ T ,... t h T : tms can 
be easily done by continuing the string h-T, ■ ■ ■ ,hr beyond T in a standard way (in 
the above sense) to h-T- n , ■ ■ ■ , hr+ n so that M^ T+ri / l _ T _ re = 1 and then continuing it 
periodically. 

We can now define probability distributions on the phase space C simply by defining 
probability distributions on the space Km of compatible sequences with compatibility 
matrix M of a fine Markov partition £. 

The simplest way of constructing probability distributions on /Cm is to think of the 
symbolic sequences as spin chains on a one dimensional lattice (the time in our case) 
and to assign an energy to each chain and construct the corresponding Gibbs probability 
distribution. 

We recall that an energy function for a chain of spins is a function £o(h), to be called 
the energy of interaction of the spin at with the other spins of the chain, which has 
short range, e.g. for some k,n > is is \£o(h) — £o{h')\ < fce~ K l Ar l if h and h' agree on 
the sites j with |j| < N. 

In other words an energy function has short range if it depends on a finite number of 
spins ("Ising chains") or if it depends "exponentially little" on the "far" spins. 
Then one defines a Gibbs distribution with energy functions {ij(h)} by setting: 



/ 



Kdh')F(h') = Hm — — =jjmMf) (4-1) 



.,h T e 



here the infinite sequence h is a "standard" (in the above sense) continuation of the 
string h-T, ■ ■ ■ , hx to an infinite compatible sequence. 

If £j verifies mild translation invariance properties, e.g. if £j( h) = £+{h) for all j > 
and £j( h ) = £-{h) for all j < 0, the limit exits and is unique. 

The metric dynamical system (/Cm, A) nas very strong cluster properties, for instance 
it is exponentially mixing in the sense that, if (-)^ denotes /i-average the correlation: 
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(F(S^)F(-)}^ (F(^.) M F(.))^ 7 - I ->0 (4.2) 

and the convergence to zero is exponentially fast, for all functions on JCm with short 
range in the above sense. 

One can construct in this way three gibbsian probability distributions that we call 
/UOiM+jA-- If h. * — >x (lk) is t ne code that associates with each compatible history h G 
/Cm the point x e C with that history, the distributions fi, fi± are the Gibbs distributions 
with respective potentials: 

, , _ f X+(h) = logA u (x{h)) 
A -> \X-(h) = -logA s (x(h)) 

£j(h)=X + (h) 

£ j (h)=X_(h) 

Since the map h — > x(h) is such that d(x(h), x(h')) < Be~ XN if the histories h, h' 
agree between time —N and time N, and since the functions A u (x), A s (x) are Holder 
continuous (see §1), it follows immediately that lj(h) have short memory and therefore 
they define Gibbs states /2o,M+,M-- 

The above distributions fi can be coded back to distributions on C simply by using the 
code h — > x(h) between JCm and C and setting: fi(x(G)) = fi(G) for a generic set G. 
The following theorem by Sinai is the fundamental result, [2]: 

Theorem 2: Coding back into distributions on C the distributions fio,jl± one obtains 
three distributions /Uo,/U + ,At_. The first fi is proportional to the volume, and in general 
it is not S -invariant, while the two others are the forward and backward statistics of the 
volume. 



if J > 
if j<0 



(4.3) 



Thus this theorem solves completely the problem of the existence of the SRB distribu- 
tions ju)±. It also gives us a concrete and usable representation for the SRB distribution 
via (4.3) and (4.1). 

The choice of the "standard" continuation of h-T, ■ ■ ■ , hr to h e Km in (4.1) is quite 
arbitrary: but the above theorem implies that it is immaterial. In particular a very 
popular choice is the periodic continuation which leads to the alternative representation 
of the SRB distributions fi + and ,u_ known as the periodic orbit expansion. Although 
usually (and inexplicably) surrounded by an aura of mystery this is a representation that 
is easier to communicate to unfavourable audiences as it does not require the symbolic 
dynamics analysis (on which, however, it rests): 



I 



F(x)„ + (dx) = km ^™x = ,JX*)e {4A) 



which is a formula that is an immediate (non trivial) consequence (if one takes for granted 
the theory of one dimensional Gibbs states) of the previous (4.1), (4. 3). 
The periodic orbit representation of measures on on C has the disadvantage that the 
volume distribution /io does not have a natural representation: only /z+ (and /i_) are eas- 
ily represented. This is a drawback as (4.1), (4. 3) make very transparent the relationship 
between fiQ, fi± and makes it obvious why fi± are the forward and backward statistics of 
Mo- 

It is also possible to write (4.1) with £j — X + "directly" as a distribution on C as: 
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/ 



^.e-EL'-T^)^')^.) 
, + (dx)F(x) = lim MF) = T lim ^T^~^i^ ( 4 ' 5 ) 

where the sum runs over the elements Ej of the pavement S T £ V ... V S~ T Eh T , h is 
the extension of (/i_t, • ■ • , hr) to a string in JCm and £j = x(h). The function F is any 
smooth (i.e. Holder continuous) function on C. 
The relation (4.5) is also written as: 



/ 



M+ (dx)F(x) = lim % ■ 3 / = lim MF) (4.6) 

because by definition the jacobian determinant A u ^(x) of the jacobian of the map S* as 
a map of Wg_ t/2x to W^ t/2x is, see (3.1), exp £* fc {f ^ A+^XS^Xj). 

We now show informally why (4.6) implies the fluctuation theorem, [13], [7], [17]. 

We first evaluate the probability, with respect to fi T / 2 of (4.6), of a T (x)/(a) + G I p 
divided by the probability (with respect to the same distribution) that a T (x)/(a)+ G I- p . 
This is: 

Z)j,a T Qrj)=p A uA X j) ^ ^ 

£j, a T (xj)=-p A u,r( x j) 

Since ^ T / 2 in (4.6) is only an approximation to fi + an error is involved in using (4.7) as 
a formula for the same ratio computed by using the true /z+ instead of (J, T /2- 
It can be shown that this error can be estimated to affect the result only by a factor 
bounded above and below uniformly in r,p, [13]2. This is a remark technically based on 
the thermodynamic analogy pointed out in (4.1) above. 

We now try to establish a one to one correspondence between the addends in the nu- 
merator of (4.7) and the ones in the denominator, aiming at showing that corresponding 
addends have a constant ratio which will, therefore, be the value of the ratio in (4.7). 

This is possible because of the reversibility property: it will be used in the form of its 
consequences given by the relations (3.2). 

The ratio (4.7) can therefore be written simply as: 

EE 3 -,o T (xj)=p ^-u,r( x i) _*52iE j ,a T (x j )=p^-u,T\ x i) . . 

(4.8) 



T, Ej ,a T (x } ) = -p A u,r i x J ) EiJ 3 ,a T (x 3 )=p A s,t { x j ) 

where Xj G Ej is the center in Ej. In deducing the second relation we make use of the 
existence of the time reversal symmetry i and of (3.5), and assume that the centers Xj,Xj> 
of Ej and Eji = iEj are chosen so that Xji = ixj. 

It follows then that the ratios between corresponding terms in the ratio (4.8) is equal 
to A U T (x)A S T (x). This differs from the reciprocal of the total change of phase space 
volume over the r time steps between the point S~~ T l 2 x and S r l 2 x only because it does 
not take into account the ratio of the sines of the angles a(S~ T l 2 x) and a(S T / 2 x) formed 
by the stable and unstable manifolds at the points S~ T l 2 x and S T ^ 2 x, see footnote 1 . 
But A UT (x)A S T (x) will differ from the actual phase space contraction under the action 
of S T , as a map between S~ T l 2 x and S T ^ 2 x, by a factor that can be bounded between 
B^ 1 and B with B — max T T i \ sm al f}\ which is finite by the transversality of the stable 

x ' x |sina(x)| J J 

and unstable manifolds. 
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Now for all the points Xj in (4.8), the reciprocal of the total phase space volume change 
over a time rt is e' 7 -<-( x i)( c ')+ T , which (by the constraint imposed on the summation labels 
a T /(a) + = p) equals e T ^+ p . Hence the ratio (4.7) will be e T ^+ p . It is important to 
note that there are two errors ignored here, as pointed out in the discussion above. They 
imply that the argument of the exponential is correct up to p, r independent corrections 
(which are in fact observed in the experiment as fig. 3 of [19] shows, or as it is shown by 
Fig. 2 below). One should note that other errors may arise because of the approximate 
validity of our main chaotic assumption (which states that things go "as if the system 
was Anosov): they may depend on N and we do not control them except for the fact 
that, if present, their relative value should tend to as N — > oo: there may be (and 
very likely there are) cases in which the chaotic hypotesis is not reasonable for small N 
(e.g. systmes like the Fermi-Pasta-Ulam chains) but it might be correct for large N. We 
also mention that for some systems with small N for which the chaotic hypothesis may 
be already regarded as valid (e.g. model 1 with N = 1 in [20]). 

The p independence of the ratio (3.5) is therefore a key test of the theory (and it should 
hold with corrections of order 0(r _1 ) if is evaluated by its finite r approximation 

Cr(p))- 

The fluctuation theorem is not immediately applicable in many cases. For instance in 
the cases in which the attractor is an axiom A attractor smaller than the whole phase 
space. 

If however we imagine that the chaotic hypothesis holds in the stronger sense that the 
system verifies the Axiom C then we can use that the map i that maps the backward 
attractor A- into the forward attractor A+ and combine it with the time reversal sym- 
metry i to generate a map i* = li which maps the two poles A+ and A- into themselves 
(see §2). The new map i* is a time reversal symmetry that anticommutes with the time 
evolution, i.e. Si* =i*S~ x leaving A+ invariant. 

Thus the fluctuation therem holds for such systems because we can think that (A+ , S) 
is a reversible transitive Anosov system to which we can apply the above arguncnts. 

Of course if we take the above viewpoint the phase space contraction that the theorem 
referes to should no longer be the contraction of the total volume but just the contraction 
of the surface measure of A+ which will now play the role of hq. 

The latter result might be difficult to use as the attractor A+ is very likely to be an 
unreachable object. And in fact to be applied new properties must hold. It is remarkable 
that such poperties had already been observed in some special systems, much earlier than 
the fluctuation theorem. We discuss the applications of the theorem in §4. 

Note that the theorem is a large fluctuation theorem: therefore it is very hard to test. 
The measure of how large are the fluctuations can be grasped also by the attempts that 
have been proposed in the literature to link the phenomena that occurr in models like 
the first one in §2 and the Loschmidt paradox, [21]. Clearly the deviations of p from the 
average are large deviations that, in order to be neasured, certainly involve observations 
of currents opposite to the field and violating the second law. 

§5. Applications: reversible conduction, fluids. Onsager reciprocity. Dynamical ensem- 
bles. Irreversible systems. 

The first applications are numerical tests. The basic result is [19] which was the origin of 
the whole story: the chaotic hypothesis was formulated to find a theoretical explanation 
of the results of this important experiment. The latter authomatically provides a first 
test of the hypothesis. 

Experiments designed to test the hypthesis have since been performed. I quote here 
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the first of them, [14], that studies the reversible conduction model in example 1, §2. 
The following graphs are the results of tests of the fluctuation theorem in a 2 or 10 
particles system of hard spheres enclosed in a periodic box containing two hard sphere 
fixed obstacles and subject to a very large electric field (E = 1 which is huge in physical 
units if one thinks of the particles as electrons in a crystal). 

The following are the graphs of the empirically determined x{p) = {({—p) — C(p))/(< 7 )+• 
The case of 2 particles and semiperiodic boundary conditions yields: 



x(p) 



t=20 



x(p) 



...» ,,, 



= 100 



> t 



1 



V 



0.0 0.5 



1.0 



1.5 



V 



Fig.l: The graphs for the fluctuation theorem test N — 2; the dashed line is the fluctuation theorem prediction, 
r — 20, 100. The arrows mark the point at distance 
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And the case of 10 particles and semiperiodic boundary conditions gives: 
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Fig. 2: The linear fluctuation test, r = 20, 40, 80, 100 for N = 10. The dashed line is the fluctuation theorem 
prediction for r — +oo. The arrows mark the point at distance ((p — l) 2 ) from 1, from [14]. 



According to the theorem the graphs should be a straight line with slope 1. The graphs 
show the data as well as the theoretical line with slope 1 (dashed). The function £(p), 
hence x(p), in fact depends on r, the time interval over which the fluctuations are ob- 
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served and should be more properly denoted Cr(f>)- We show the graphs of x{p) versus 
p for various r: the slope 1 is exact only for r = +00 but we see that it can already be 
observed for reasonable r-values. 

The experiments in [14] showed various other interesting phenomena: namely the fact 
that for large fields the attractor is certainly smaller than the full phase space. 

The interpretation of the fact that in spite of that the experimental results still matched 
with (3.5), generated the idea of strengthening of the chaotic hypothesis by replacing the 
assumption that the system has attractors that can be regaded as Anosov systems by 
the stronger assumption that the system verifies Axiom C, which should have a general 
validity in time reversible systems: see [6]. 

If (C, S) is reversible and verifies Axiom C the map % defined via geometric considerations 
in §2 will commute with i (by the covariance under i of the manifolds used to build it) 
and therefore the map 11 — i* will leave invariant the poles Cl± of the system and on 
them it will anticommute with S. 

This means that even though time reversal symmetry is " lost" or " broken" on f2± there 
is another transformation (i*) leaving fi± invariant and anticommuting with S, i.e. which 
plays the role of a time reversal symmetry for the restriction of S to f2+ (and fi_). 

This implies that in reversible Axiom C systems time reversal is undestructible: if the 
attractor becomes smaller than the full phase space one can say that the time reversal 
symmetry is spontaneously broken. But on the attractor one can still define a map i* =/= i 
which anticommutes with time and leave the attractor invariant. This is the "real" 
time reversal for the attractor, or the local time reversal, see [6]: while the "true" time 
reversal appears as a broken symmetry to an observer that only observes motions that 
are attracted by fi+. 

A situation reminiscent of the violation of time reversal in elemntary particles physics, 
where the time reversal symmetry, or T symmetry, is broken but there is another sym- 
metry that chab=nges the sign to time, the TCP symmetry. Here i plays the role of T 
and i* the role of TCP (and 1 that of CP). 

Hence the volume on the attractor will contract obeying the fluctutation theorem. This 
in itself is not sufficient to study the phase space volume fluctuations because, as said 
above, one does not have direct access to the surface area of the attractor (and to the 
attractor itself, as a matter of fact). 

Then manifestly one would be back with an Anosov system (on a lower dimensional 
manifold) and a version of the fluctuation theorem would still hold. Furthermore one 
could say that this is only a different interpretation of the chaotic principle. 

If this picture is correct we can write the phase space contraction rate (see (3.2) a(x) — 
cto(x) + a±(x) where ctq{x) is the contraction rate on the surface on which the attractor 
lies and a±(x) is the contraction rate of the part of the stable manifold of the attractor 
which is not on the attractor itself (the angle between the part of the stable manifold 
sticking out of the attractor and the attractor itself is disregarded here as we think that 
it is bounded away from and it since the attractor is compact). 

Local time reversal will change the sign only ofao(x) and the fluctuation theorem should 
apply to the fluctuations of a . But cto(x) is not directly accessible to measurement: 
nevertheless we can still study its fluctuations via the following heuristic analysis, [14]. 

Considerable help comes from a remarkable theorem that was discovered experimentally 
in [22], [23]. In the full phase space of the equations in the reversible example 1) in §1 
the Lyapunov exponents verify a pairing rule. Namely if 2D = AN — 2 is the number of 
exponents and the first D = 2N — 1 exponents, Xf , . . . , A^ v _ 1 , are ordered in decreasing 
order and the next 2N — 1, X[ , . . . , A 2Ar _ 1 , are ordered in increasing order then: 
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const for all j = 1, . . . , D 



(5.1) 



The constant will be called "pairing lever or "pairing constant': it must be + . 

The pairing rule, in fact, formally holds in the present example as a consequence of [24]. 

In the cases in which (5.1) has been proved, [24], it holds also in a far stronger sense: 
the local Lyapunov exponents, of which the Lyapunov exponents are the averages, are 
paired as in (3.2) to a constant that is j independent but, of course, is dependent on the 
point in phase space. We call this the strong pairing rule. From the proof in [24] of the 
pairing rule one sees that the jacobian matrix J = dS of the map S is such that V 'J* J 
has D pairs of eigenvalues (D = 2N — 1 in the case of example 1, §2) and the logarithms 
of each pair add up to cr(x) (see also (2. 4), (2. 5)). 

The simplest interpretation of this, consistent with the proposed attractor picture, is 
that pairs with elements of opposite signs describe expansion on the manifold on which 
the attractor lies. While the M < D pairs consisting of two negative exponents describe 
contraction of phase space transversally to the manifold on which the attractor lies. 

Then we would have o~o(x) = (D — M)o~(x) and we should have a fluctuation law for 
the quantity p(x) associated with o~o{x) defined by (2.6) and (2.7) with uq replacing a, 
i.e. (accepting the above heuristic argument, taken from [14]): 



(D — M) 
o-0t(x) = {o-)+p{x) 



i.e. a law identical to (2.10) up to a correcting factor 1 



M. 

d ■ 



pr{a). 



log 



Tr(-P) 



(1 



M 
~D 



)P 



(5.2) 



(5.3) 



The graphs of Fig. 2 are relative to an experiment in which we see that there may be one 
negative exponents in excess over the positive ones as said above. 
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Fig. 3 The 38 Lyapunov exponents, N — 10. Small picture magnifies the tail of the larger, showing better 
the pairing rule and that the 19-th exponent is slightly negative, from [14]- 
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In fact a study of the exponents values in the case of 10 particles and semiperiodic 
bounday conditions gives the above diagram in which the Lyapunov exponents corre- 
sponding to E = 1 are drawn as a function of the field E. Which shows a very small 
exponent which is the higher member of a pair and, yet, is negative. The graphs of Fig. 
2, however, show that the agreement of (4.1) with the experiment is within the errors: 
had there been no negative exponents we would have expected a slope 1. If there is one 
negative exponent in excess we expect a slope 1 — yg which is within the error bars in 
Fig. 2. An excess of 2 exponents would yield a slope of 1 — ^ which is out of the error 
bars. 

Note that since the exponent smallest in modulus is so small we must expect that 
it yields a clear effect only after extremely long times have elapsed {i.e. for values of 
t > 4. 10 3 : totally out of computability). At larger fields the number of negative pairs 
increases considerably: but the fluctutation theorem cannot be tested as the fluctuations 
become too improbabble to be measurable, even with the largest computers. 

The reversible case of example 2) has not been studied. Nor the reversible case of model 
3). Studies are being performed at least for the model in example 3). 

More generally, however, we do not expect the pairing rule to hold. For instance this 
is clear in the case of the second example in §2: in that case the Lyapunov exponents 
relative to the thermostat are obviously paired to while those of the conduction particles 
have a negative total sum (by the H-theorem of Ruelle, [16], quoted before (3.3)). 

Nevertheless some kind of pairing might still occurr. In such cases one could envisage 
that (5.1) is replaced by a relation like: 

1 - 3 ={a) + -L 5.4 
2 w+ 2D 

where {a) is the fi + average of the phase space contraction per unit time; and Cj is some 



d e f 

suitable function of j, while D = ^2 

We therefore define Cj by the (5.4) without attempting at determining them a priori. 
Then one can think that (5.4) holds in a "almost local" form in the sense that on a rapid 
time scale (5.4) becomes true also for the local exponents. This means that, up to an 
error that tends to zero very quickly with the time r, the logarithms of the eigenvalues 
of the matrix {J^{x) J T {x)) 1 / 2t , with J T {x) being the jacobian matrix for the evolution 
operator V T at x, divided by r verify ^(A^ + Xj) = cj/3 r {x) with f3 T {x) denoting the 

average Ej= 2 -~ T /2 P{S 3 x) dt. 

This property, together with the Axiom C assumption, will then suffice to extend, in 
a suitable form, the validity of the predictions of the fluctuation theorem based on the 
pairing rule {i.e. to cases in which the attractor is smaller than phase space). 

We first remark that the really relevant feature of the pairing rule, as far as the above 
applications are concerned, is not the constancy of the pairing but, rather, the fact that 
some kind of pairing takes place on a fast enough time scale, see [10]. 

Then if a local time reversal exists on the attractor {i.e. if the geometric Axiom C is 
assumed as well for the dynamics the fluctuations of the observable a will have a "free 
energy" (or a "generalized sum of Lyapunov exponents" to adhere to the terminology in 

y cj 

[25], [26]) ({p), in the sense of (3.4), with an odd part pP (<r) + , with P equal to 1 — ^ — 

where the J2- runs ver the values of the j's to which correspond two negative Lyapunov 
exponents: 
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C(-P) - C(p) = pdef 1 _ E 



M+p E< 



(5.5) 



This is a property whose validity can be conceivably tested in, real or numerical, exper- 
iments. At least the linearity in p of — (C(p) — C(~P))/( (T )+ with a slope < 1 should be 
observable. 

Another application is the already mentioned relation between the fluctuation theorem 
and the Onsager reciprocity. 

We imagine a system even more general than the ones in §2. We assume that it is 
a reversible dissipative system with several forces G — (G±, . . . ,G S ) acting upon the 
particles (or on the fluid in case of fluid mechanics models). 

The G are parameters, with dimension not necessaribly being that of a force, measuring 
the strength of the various causes of noncquilibrium (e.g. they could be electromotive 
fields as abve, or temperature differences or other as in the references [27]). We follow 
here the analysis in [28] rather than the one in [27]. 

We suppose that the phase space contraction rate can be written in increasing powers 
of G: 

a^^G^H^ + OiG 2 ) (5.6) 

i=l 

this simply means that we suppose that at zero forcing, G = , there is no phase space 
contraction and, in fact, we always suppose that in such case the system is conservative 
(although this is not really necessary) and ^ + |g = o is time reversal invariant. 
The interpretation of a(x) as a microscopic density of entropy production allows us to 
establish in a unambiguous way the duality between thermodynamic fluxes or currents 
and thermodynamic forces. 

We define the current associated with the force Gi by Ji(x) — 9g ; ct(x) and the transport 
coefficients by setting Lij = da i (Jj(x)) + | g=o an d we study L y -. We want to show that 
the above ideas suffice to prove Onsager's reciprocal relations i.e. Ly = Ljj. 

The analysis is based on the remark that the fluctuation theorem can be extended to give 
properties of the joint distribution of the average of a, (5.6), and of the corresponding 
£i + -average of GjdcjCr. 

In fact we shall define the dimensionless current associated with the force Gj by q = 
q(x): 



- f ^ Gjd Gj a(S t x)dt =' Gj (d Gj a)+ q (5.7) 

r J —r/2 



where the factor Gj is introduced here only to keep a and Gjda j o' with the same dimen- 
sions. 

Then if w T (p, q) is the joint probability of p, q the same proof of the fluctuation theorem 
discussed above yields also that, if we define: 

lim - log7r T (p, q) = -C(p, q) (5.8) 

then: 

C(-P,-g)-C(P,g) = ! (59) 
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The proof in [17] shows that q) is analytic in the interior of the domain of variability 
of p, q at least if this domain is open. The latter condition means that p, q are independent 
variables for all G small enough. 

This is the "normal" case: it is violated essentially only if the forces are not really 
different although they are given different names, e.g. one could have only one force 
G\ acting on the system but one could regard it as two equal forces. In this case a 
and G\d Gl a would coincide (at least to first order in G) and the p, q would not be 
independent. 

We can compute ((p, q) as usual in statistical mechanics: by considering first the trans- 
form A (ft, ft): 

A(ft = lim -log / e T ^ {p - 1){a)++h{q - 1)(GidG i a)+ \ T (p,q)dpdq (5.10) 

T— >OG T J 

and then the Legendre trasform: 

C( P ,q) = max (ft (p-l)(a) + +p 2 (q-l)(G j d Gj cT) + - A(ft) (5.11) 

Pl,P2 — 

The function A(/3), (3 — (ft, ft), is evaluated by the cumulant expansion. For the 
purpose of shortening the notation introduce X(x) as: 

t/2-1 t/2-1 

X(x) =ft - (*>+) +ft E {GjdM&x) - (Gjdjvh) (5.12) 

t=-r/2 t=-r/2 

so that X has /u+-average and the function A( (3 ) is simply written as: 

e rA( ^ = J e x ^n+{dx) = (e x )+ (5.13) 

But (X) + = by the definition so that the cumulant expansion for J e x ^fi + (dx) yields, 
forgetting 0(G 3 ): 

fe x ^fx+{dx) = e * <x2) + (5.14) 

and (X 2 } + is a quadratic form Cft (3) in /3 (because X is linear in ft with coefficients 
given by the cumulants: 

oo 

c 11 =E(M 5t -M0>+-K-)>y 

— oo 
oo 

C 22 =J2 ({G^oiS 1 ) ■ Gjd Gj a(-)) + {Gjd Gi a{-))l) (5.15) 

— OO 

oc- 

C 12 =C 21 = ■ Gjd G A-))+ (Gjd Gj a(.)) + ) 

— oo 

The four cumulants form a symmetric matrix that will be called C. 
The functions A( (3 ) and ((p, q) are related via a Legendre transform: 

C(p,q)=msx(0-w-\([})) (5.16) 
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where w~ (P- 1 ^ + 

So that the maximum condition yields the equations for the value of (3 where the 
expression in parenthesis in (5.16) reaches the maximum (3 = /3 max : 

-!){&)+ =d Pl \{p)\ p=p 

- - -max / 51 ^ 

{ q -l)(G J d 3 a) + =d P2 \(P)\ _ = p_ 

so that, always performing the computations by neglecting quantities of 0(G 3 ) (i.e. by 
using A(£) = ±(C£,£)): 



w = 



and from (5.16), evaluating the r.h.s. at the maximum point j3 , we get: 



CM = UL • £ max - HP m J = \{w,C-^) (5.19) 



^ = ^(31 ~n 12 ) (5-20) 



Since: 



det C \ —C21 Cn 
we see that (recall that G is symmetric): 

C(p,«) = ^(Oii(p-1)V>++ 

+ ^(C- 1 ) 22 (5 - l) 2 ^-*)* + (Oufo - 1)(P- 1W 
and the odd terms in (p, g) have coefficients: 

9 - (-2(C- 1 ) 22 G J (d Gj a) + -2(C- 1 ) 12 (v) + )G J (d Gj a) + =Q 
p (-2(C- 1 ) 11 M + -2(C- 1 ) 12 G,(a Gj a) + )( ( r) + ) = -( ( T) + 



(5.21) 



(5.22) 



where the r.h.s. arise by imposing compatibility with the fluctuation theorem relation 
(5.9). 

The above two relations, after some simple algebra, are seen to imply: 

(a> + =icu + 0(G 3 ) 

2 (5.23) 
(G J 9 G ^) + =-C 12 + 0(G 3 ) 

Then expanding both sides of (5.23) to lowest order (i.e. lowest non trivial, the second 
in this case) in the Gj's we get the Onsager relation and the Green Kobo formula. 
In fact first we look at the r.h.s. of the first relation in (5.23): in C\\ one simply replaces 
a, see (5.15), by its expansion to first order, (5.6), and the r.h.s. becomes a quadratic 
form in G with coefficients given by: 

- J dt((Jf{S t -)j9(.)) + (J°) + (J°)+)| G=0 (5.24) 
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On the other hand the expansion of (cr)+ in the l.h.s. of the first of (5.23) to second 
order in G gives: 

(<7) + = ^G i a Gi (<7) + | + ^G i G j (9 Gi (a Gj .(<7) + )| (5.25) 

ij 

the first term vanishes (by time reversal, or just because it is clear from the r.h.s. of 
(5.23) that (a) + is of second order in G). The second term is the sum of jGiGj times: 

d Gi d Gj J (j{x)ii+{dx) = j (d Gi d Gj a{x)n + {dx) + ^ 
+ d Gi (T(x)d Gj n + (dx) + d Gj a(x)d Gi iJ, + (dx) + a(x)d Gz d Gi ^i + (dx)) 

evaluated at G = 0; and the first term in the r.h.s. vanishes (by time reversal it changes 
sign, while ^+ is invariant) the second and third terms are dj < Jf >+ |o + {i* — >j) and 
the fourth vanishes (because a = at G = 0): note that (d Gj < Jf >+)|o is equal to 

{d G] < Ji >+)| ). 3 

Therefore by equating the r.h.s and the l.h.s. of (5.23), we get from the first of (5.23) the 
above expression (5.24) for the matrix L ' 3 ^ Lj ' giving Green-Kubo's formula for i = j 
(but not the Onsager reciprocity nor the general Green-Kubo formula which would say 
that Lij equals (5.24)). 

The same type of analysis on the second of (5.23), which, unlike the first relation, is not 
symmetric in the G 's as j is privileged, leads to the " asymmetric relation" : 

L ji = d Gj {d Gi o)\ G = a _ = - ((J?{Sr)j9{-)) + -(J?) + (J°) + )\ G=0 (5.27) 

t— — oo 

which gives the general Green-Kubo formula, hence Onsager reciprocity and the fluctu- 
ation dissipation theorem. 

Thus the Onsager relations are a consequence of the fluctuation theorem (not surpris- 
ingly) and of its (obvious) extension, (5.9), in the limit G — > 0, when combined with 
the cumulant expansion for entropy fluctuations. Those theorems and the fast decay of 
the crcr correlations are all natural consequences of the chatic hypothesis in reversible 
statistical mechanical or fluid mechanical systems. 

Therefore while the Onsager reciprocity and Green-Kubo formulae (or fluctuation dis- 
sipation theorem) only hold around equilibrium, i.e. they are properties of G-derivatives 
evaluated at G = 0, and the cumulant expansion for A(/3) is a general consequence of 
the correlation decay in Anosov systems, the fluctuation theorem also holds far from 
equilibrium, i.e. for large G, and can be considered a generalization of the Onsager 
relations and of the Green-Kubo formulae, [28]. 

Evidence for the above interpretation of the fluctuation theorem arose in [14] in an effort 
to interpret the results of various numerical experiments and an apparent incompatibility 



^ This is again the same argument: 



dj < Ji >+ |o = 9j ( / di<j(x)n + (dx))\ — ( / djdia(x)fj, + (dx))o + ( I dicr(x)dj[i+ (dx)) ■■ 
= + (J d t a(x)\od :i ti + (dx)\o = dj < J° > + )o 
where the last step uses that j9 being evaluated at G — does not depend on G. 
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of the a priori known non gaussian nature of the distribution ir T (p) and the " gaussian 
looking" empirical distributions. 

The next question is what we can say about the irreversible models? The key is the 
already mentioned conjectured equivalence of the corresponding SRB distributions. 

The equivalence conjectures seem to raise the possibility of a genral theory of non 
equilibrium statistical ensembles and of their equivalence. 

The idea of using reversible equations to study rreversible behaviour is quite appealing 
as we have seen that there are by now quite a few results that can be obtained for 
reversible dissipative evolutions. Hopefully the theory will grow and we shall learn to use 
the chaotic hypothesis and the ensuing characterization of the SRB distributions. After 
all in equilibrium statistical mechanics it took a long time to develop the theory from 
the heat theorem to the universal phenomena at criticality. 

Unfortunately the equivalence of the ensembles does not allow us to make predictions 
based on the fluctuation theorem about the fluctuations of the entropy production in 
irreversible systems, even when equivalent to reversible ones. This is so because the 
entropy production is a "global" observable that in the irreversible models is ususally 
fixed a priori to a given value or, at least, to a value related to a quantity fixed a priori. 

Thus the fact that we cannot translate knowledge of the entropy fluctuations in reversible 
models to irreversible equivalent models becomes analogous to the impossibility of trans- 
lating information on the energy fluctuations in the canonical enseble to informations (of 
any interest) on the energy fluctuations in the microcanonical ensemble. 

Thus although equivalence can be tested, we have, so far, no special predictions to 
mention aside the obvious one that one should get the same result when measuring 
corresponding quantities (a rather non trivial and interesting property): the fluctuation 
theorem is not directly usable beyond the fact that it implies the Onsgaer reciprocity. 

In [10] some applications have been proposed for the analysis of the models in the 
example 3) of §2 and they may be tested, perhaps, experimentally. But this is beyond 
the scopes of the present review. 

The equivalence conjecture on the other hand gives us an opportunity to test how good 
the approximations of irreversible equations for macroscopic phenomena are. 

In fact if a measurement (a hypothtical one as so far no measurements of this type seem 
to have been performed) of the entropy fluctuations agrees with the fluctuation theorem 
then this would mean that the reversible models are better models (compared to the 
irreversible ones) of the system as they give the same predictions for normal observables 
but for the entropy production fluctuations they give different predictions (compared to 
the ones of the irreversible models). 

Appendix Al. An example of Axiom C system. 

We give here an example, taken from [6], in which i*, the local time reversal, arising in 
the applications can be easily constructed. The example illustrates what we think is a 
typical situation. The poles Q±, £!_, will be two compact regular surfaces, identical in 
the sense that they will be mapped into each other by the time reversal i defined below. 

If x is a point in M* = fi + the generic point of the phase space will be determined by 
a pair (x, z) where x € M* and z is a set of transversal coordinates that tell us how far 
we are from the attractor. The coordinate z takes two well defined values on Q + and f2_ 
that we can denote z+ and z_ respectively. 

The coordinate x identifies a point on the compact manifold M* on which a reversible 
transitive Anosov map 5* acts (see [18]). And the map S on phase space is defined by: 
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S(x, z) — (S*x, Sz) 



(Al.l) 



where S is a map acting on the z coordinate (marking a point on a compact manifold) 
which is an evolution leading from an unstable fixed point z_ to a stable fixed point 
z+. For instance z could consist of a pair of coordinates v,w with v 2 + w 2 = 1 (i.e. z 
is a point on a circle) and an evolution of v,w could be governed by the equation v = 
—av, w = E — aw with a = Ew. If we set Sz to be the time 1 evolution (under the 
latter differential equations) of z — (v, w) we see that such evolution sends v — > and 
w — > ±1 as t —* ±oo and the latter are non marginal fixed points for S. 
Thus if we set S(x, z) = (S*x, Sz) we see that our system is hyperbolic on the basic sets 
il± = M* x {z±} and the future pole f2+ is the set of points (x, z+) with x € M»; while 
the past pole f2_ is the set of points {x,zJ) with x <G M*. 

Clearly the two poles are mapped into each other by the map i(x, z±) = (i*x, z T ). But 
on each attractor a "local time reversal" acts: namely the map i*(x, z±) — (i*x, z±). 

The system is "chaotic" as it has an Axiom A attracting set with closure consisting 
of the points having the form (x, z+) for the motion towards the future and a different 
Axiom A attracting set with closure consisting of the points having the form (x, Z-) for 
the motion towards the past. In fact the dynamical systems (Q+,S) and 5) obtained 
by restricting S to the future or past attracting sets are Anosov systems because fi± are 
regular manifolds. 

We may think that in the reversible cases the situation is always the above: namely there 
is an "irrelevant" set of coordinates z that describes the departure from the future and 
past attractors. The future and past attractors are copies (via the global time reversal 
i) of each other and on each of them is defined a map i* which inverts the time arrow, 
leaving the attractor invariant: such map will be naturally called the local time reversal. 

In the above case the map i* and the coordinates (x, z) are "obvious". The problem is 
to see that they are defined quite generally under the only assumption that the system is 
reversible and has a unique future and a unique past attractos that verify the Axiom A. 
This is a problem that is naturally solved in general when the system verifies the Axiom 
C of §1. 

In the following section we shall describe the interpretation of i* in terms of symbolic 
dynamics when the system verifies Axiom C: as one may expect the construction is 
simple but it is deeply related to the properties of hyperbolic systems such as their 
Markov partitions. 

Appendix A 2. Ising model analogy. 

The fluctuation theorem as expressed by (2.10) and the subsequent comments on the 
Gaussian nature of the function n T (p) may seem somewhat strange and unfamiliar. 

It is therefore worth pointing out that the phenomenon of a "linear fluctuation law" 
on the odd part of the distribution, in the sense of (4.1), without a globally Gaussian 
distribution, is in fact well known in statistical mechanics and probability theory. And 
an example, proposed in [14], of what the fluctuation theorem means in a concrete case, 
in which n T (p) is not Gaussian, can be made by using the Ising model on a 1 dimensional 
lattice Z. 

We consider the space C of the spin configurations a = {o^}, £ G Z and the map S 
that translates each configuration to the right (say). The "time reversal" is the map 
i : {er} — > a} that changes the sign to each spin. 

The probability distribution that approximates the SRB distribution is the finite volume 
Gibbs distribution: 
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~ normalization 

where A = [— T, T] is a large interval, J,h > 0. The configuration a outside A is 
distributed independently on the one inside the box A, to fix the ideas. 
Calling (m) + the average magnetization in the thermodynamic limit we define the mag- 
netization in a box [— §] to be M T = r(m)+p and we look at the probability distribution 
7r^(p) of p in the limit T — > oo. The Gibbs distribution corresponding to the limit of 
(A2.1) will play the role of the SRB distribution. Calling this limit probability n T (p) it 
is easy to see that: 

n r(p) _ 2r/i<m>+p 



e 2rh{m) + p (A2.2) 



This is in fact obvious if we take the two limits T — > oo and r — > oo simultaneously by 
setting T = |. In such a case, if E CT ;p denotes summation over all the configurations 

with given magnetization in [— T, T], i.e. such that EjL-x <7j = (m) + p the distribution 
(A2.1) gives us immediately that: 



e 2rh(m)+p ^ 23 ^ 



T J 



if we use the symmetry of the pair interaction part of the energy under the "time reversal" 
(i.e. under spin reversal). 

The error involved, in the above argument, in taking T = ^ rather then first T — > oo 
and then r — > oo, can be easily corrected since the corrections are "boundary terms", 
and in one dimensional short range spin systems there are no phase transitions and 
the boundary terms have no influence in the infinite volume limit (i.e. they manifest 
themselves as corrections that vanish, as T — > oo followed by t — > oo). 

One may not like that the operation i commutes with S rather than transforming it 
into S^ 1 . Another example in which the operation i does also invert the sign of time is 
obtained by defining i as {j£}| 3 = — u-f the (A2.3) can be derived also by using this 
new symmetry operation. 

The above examples show why there is a priori independence between any Gaussian 
property of 7r T (p) and the fluctuation theorem. The theory of the fluctuation theorem in 
[13] is in fact based on the possibility (discovered in [2]) of representing a chaotic system 
as a one dimensional short range system of interacting spins (in general higher that i); 
and the argument is, actually, very close to the above one for the Ising model with, 
however, a rather different time reversal operation. See [17] for mathematical details on 
the boundary condition question. 
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